Annotation Guidelines for the Chinese Proposition Bank
نویسنده
چکیده
The purpose of the Chinese PropBank (CPB) project is to add a layer of annotation to the hand-parsed sentences in the Chinese Treebank (CTB) (Xue et al., 2005). This layer of annotation assigns predicatespecific argument labels to the constituents in a parse tree. The arguments of each predicate in the sentence, which are limited to verbs and their nominalizations in the work we report here, receive an argument label in the form of ArgN, where N is an integer between 0 and 5. These numbered arguments represent core arguments that are defined in relation to the predicate, which is labeled as Rel. Each core argument plays a unique role with regard to the predicate and generally the total number of core arguments for each predicate does not exceed 6. The core arguments annotated for the verb N (”investigate”) in Example (1) are the NPs ́(”the police”) and ̄ (”accident”) Ï(”cause”), which are labeled as Arg0 and Arg1 respectively. The semantic role labels added to the parse tree are in bold.
منابع مشابه
A Parallel Proposition Bank II For Chinese And English
The Proposition Bank (PropBank) project is aimed at creating a corpus of text annotated with information about semantic propositions. The second phase of the project, PropBank II adds additional levels of semantic annotation which include eventuality variables, co-reference, coarse-grained sense tags, and discourse connectives. This paper presents the results of the parallel PropBank II project...
متن کاملAutomatic Semantic Role Labeling for Chinese Verbs
Recent years have seen a revived interst in semantic parsing by applying statistical and machinelearning methods to semantically annotated corpora such as the FrameNet and the Proposition Bank. So far much of the research has been focused on English due to the lack of semantically annotated resources in other languages. In this paper, we report first results on semantic role labeling using a pr...
متن کاملA Proposition Bank of Urdu
This paper describes our efforts for the development of a Proposition Bank for Urdu, an Indo-Aryan language. Our primary goal is the labeling of syntactic nodes in the existing Urdu dependency Treebank with specific argument labels. In essence, it involves annotation of predicate argument structures of both simple and complex predicates in the Treebank corpus. In this paper, we describe the ove...
متن کاملA Semi-Automatic Method For Annotating A Biomedical Proposition Bank
In this paper, we present a semiautomatic approach for annotating semantic information in biomedical texts. The information is used to construct a biomedical proposition bank called BioProp. Like PropBank in the newswire domain, BioProp contains annotations of predicate argument structures and semantic roles in a treebank schema. To construct BioProp, a semantic role labeling (SRL) system train...
متن کاملAdding Semantic Annotation to the Penn TreeBank
This paper presents our basic approach to creating Proposition Bank, which involves adding a layer of semantic annotation to the Penn English TreeBank. Without attempting to confirm or disconfirm any particular semantic theory, our goal is to provide consistent argument labeling that will facilitate the automatic extraction of relational data. An argument such as the window in John broke the wi...
متن کامل